Determining the Optimal File Size on Tertiary Storage Systems Based on the Distribution of Query Sizes

نویسندگان

  • Luis M. Bernardo
  • Henrik Nordberg
  • Doron Rotem
  • Arie Shoshani
چکیده

In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into files. Since in many such systems the minimum unit of data transfer is a file, it is an important problem to match file sizes with the access patterns to the data. In general, if the file size is large relative to the query size it will lead to the transfer of large amount of irrelevant data whereas small file sizes will incur an overhead penalty associated with reading each new file. In this work, we analyze the relationship between file sizes and query response times and provide a methodology to compute the optimal file size given information about the distribution of query sizes. Exact closed form solutions for the cost function are given for two common

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determining the Optimal File Size on Tertiary Storage SystemsBased on the Distribution

In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into les. Since in many such systems the minimum unit of data transfer is a le, it is an important problem to match le sizes with the access patterns to the data. In general, if the le size is large relative to the query size it will lead to the transfer of large amount of irrelevant data...

متن کامل

Optimal energy management of the photovoltaic based distribution networks considering price responsive loads, energy storage systems and convex power flows.

Nowadays, presence of photovoltaic systems in distribution network is not without challenge and it may not have economic productivity for the system under the lack of optimal management. Energy storage systems are able to cope with this problem. Therefore, in this paper, a new method is proposed for energy management of the distribution networks in order to show that how presence of the energy ...

متن کامل

Patterns to Partition Large Datasets on TertiaryStorage in order to Minimize Retrieval Costs

In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into les. In general, if the le size is large relative to the query size it will lead to the transfer of large amount of irrelevant data whereas small le sizes will incur an overhead penalty associated with reading each new le. In this work, we analyze the relationship between le sizes an...

متن کامل

MEDICAL IMAGE COMPRESSION: A REVIEW

Within recent years the use of medical images for diagnosis purposes has become necessity. The limitation in transmission and storage space also growing size of medical images has necessitated the need for efficient method, then image Compression is required as an efficient way to reduces irrelevant and redundancy of the image data in order to be able to store or transmits data. It also reduces...

متن کامل

Optimal Placement of Substations Based on Economic and Technical Risk Management

Design and expansion of distribution systems seems inevitable in view of the need to satisfy the rise in energy consumption in a technical and economical way. Optimal location, sizing and determining the service area of substations is one of the principle problems in expansion of distribution systems. Also uncertainty is one of the important factors that increase risk of exact decision makings....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998